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Model-based and Data-driven Analytic Support 
for Strategy Development 

5 TECHNICAL FIELD 

The invention relates to strategy optimization. More particularly, the invention 
relates to model-based and data-driven analytic support for strategy 
development. 

10 

DESCRIPTION OF THE PRIOR ART 

The prior art relevant to this invention is in the field of decision analysis. 

15 Decision analysis is the quantitative study of a decision situation to gain 
insight and to recommend at least one action. In decision analysis, a 
mathematical model (a decision model) of the decision situation is developed 
using probability theory and utility theory. An influence diagram is one way to 
represent a decision model. 

20 

The following sections describe the italicized terms above. 



Decision situation 



WO 02/084925 PCT/US02/1 1537 

A decision situation is a process in which an individual or organization must 
make a decision, or commit to at least one action. It may involve a single 
decision, such as a person's choice of which car to buy. Alternatively, it may 
involve similar decisions applied to all individuals in a population, such as a 
5 bank's choice of which credit card customers to offer a higher credit limit. 

A common thread to almost every decision is that there is uncertainty about 
the future outcomes resulting from the decision. In the car buyer example, 
there is uncertainty whether a particular model of car performs well over time; 
10 in the credit limit example, there is uncertainty about whether a particular 
account uses the extra credit or goes bankrupt, among other possible 
outcomes. 

Almost every decision must be made under uncertainty, on the basis of 
15 predictions or estimates of future outcomes. There is a distinction between 
the actual results and the expected results predicted by the model. The 
decision is made based on the predicted results. 

Decision model 

20 A decision model is a mathematical description of a decision situation. In 
developing a decision model, an analyst must describe the relevant 
parameters of the situation clearly enough to quantify them. In particular, the 
uncertain outcomes must be clearly specified and their uncertainties 
quantified by probability distributions. 

25 
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In decision situations involving multiple cases, such as the credit limit 
example, one uncertainty that must be quantified is the makeup and character 
of the target population over which the decision is expected to be applied. In 
many cases, for instance in a credit card application situation, the target 
5 population is unknown at the time the decision model is made. 

A decision model consists of a set of variables and the relationships between 
them. The set of variables includes decision variables, value variables, 
decision key variables, and other uncertain variables. 

10 

The decision variables represent at least one action to select in each case. 
Often, a decision is modeled as a categorical, discrete, or nominal variable 
that can take on any value from a finite set of feasible actions. There may be 
multiple decision variables in a decision model. 

15 

The value variables represent the quantity that must be optimized in the 
optimization. If there are multiple value variables in a decision model, the 
objective function of the optimization is the sum of the values of all value 
variables. 

20 

The decision key variables represent quantities that are known or observed at 
the time the decision is to be made. 

The other uncertain variables are all variables that are not decision, value, or 
25 decision key variables. 
Influence diagram 
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An influence diagram graphically represents a decision model, with nodes or 
shapes representing variables in the model, and arcs or arrows between 
nodes representing relationships between variables. Models are built and 
5 edited graphically by the user within a graphical user interface. 

An influence diagram is composed of graphical elements of the following four 
types: 

10 • Decision node, which is a square node representing a decision variable. 

• Value node, which is a hexagonal or octagonal node representing a value 
variable. 

• State node, which is a circular or elliptical node representing any other 
variable 

15 • Arc, which is a directed arrow between two nodes representing a 
relationship between the two corresponding variables 

Influence diagrams comprise a technology that has been known in the art 
since the 1960's. It would be desirable to provide improvements to the field of 
strategy optimization that extend the influence diagram. 

20 

SUMMARY OF THE INVENTION 

The invention provides an overall architecture for optimal strategy design 
using both historical data and human expertise. The presently preferred 

4 
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architecture supports the tasks of strategy design and strategy analysis, and 
provides extensions to Influence diagrams, translation of an influence diagram 
as a nonlinear optimization problem, and induction of decision rules from the 
optimization result. 

5 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a block schematic diagram showing data-driven and model-based 
strategy optimization according to the invention; 

10 

Fig. 2 is a graph showing a decision key space according to the invention; 

Fig. 3 is a graph in which strategy shown over decision key space according 
to the invention; 

15 

Fig. 4 is a tree diagram showing an equivalent decision tree according to the 
invention; 

Fig. 5 is a block schematic diagram showing a manual strategy design 
20 according to the invention; 

Fig. 6 is a block schematic diagram showing a strategy design with data 
according to the invention; 



25 Fig. 7 is a block schematic diagram showing a model-based strategy design 
according to the invention; and 
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Fig. 8 is a block schematic diagram showing a model-based strategy design 
with data according to the invention. 

5 DETAILED DESCRIPTION O F THE INVENTION 

The invention provides an overall architecture for optimal strategy design 
using both historical data and human expertise. The presently preferred 
architecture supports the tasks of strategy design and strategy analysis, and 
10 provides extensions to influence diagrams, translation of an influence diagram 
as a nonlinear optimization problem, and induction of decision rules from the 
optimization results. 

Extensions to Influence diagrams 

15 Strategy optimization provides the following unique extensions to influence 
diagram methodology: 

Constrained optimization. The ability to place constraints on an 
optimization in an influence diagram. 

20 Dataset-based optimization. The ability to optimize over an explicitly 
specified set of cases (the representative population), as well as over a set of 
cases implicitly specified by the joint prior distributions of the decision keys. 

Mapping the decision model to a dataset file. The ability to map the 
variables in an influence diagram to the columns of a dataset. 
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The translation of an influence diagram as a nonlinear optimization 
problem 

Strategy optimization translates an influence diagram as a nonlinear 
5 optimization problem. This allows the influence diagram to contain continuous 
variables with complex functional forms, the specification of high-dimensional 
representative populations, and constraints. 

The Induction of decision rules from the optimization results 

10 After the nonlinear optimization has obtained an optimization solution over the 
cases in the representative population, it uses induction techniques to derive 
decision rules that are applicable to the target population. The induction 
techniques used create strategies that are understandable, editable, and 
implementable In existing decision engine software systems. 

15 

Methodology Overview 

The methodology for using strategy optimization comprises the following 
steps: 

20 

56. Modeling a decision situation. A strategy optimization influence 
diagram and a representative population are created to model the 
decision situation mathematically. The strategy optimization influence 
diagram is formatted as a nonlinear optimization problem over a 
25 representative population. 
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57. Formulating a strategy optimization decision model as a nonlinear 
optimization problem. Nonlinear optimization techniques are used to 
assign at least one action to each case in the representative population. 
A strategy table is created containing all cases in the representative 

5 population, along with the assigned action for each case. 

58. Developing a strategy using induction techniques on the results of 
an optimization. Induction techniques are applied to the strategy table 
to derive a set of decision rules, i.e. a strategy, from the data. The set of 
rules induced can then be operationalized through a decision engine, 

10 such as those manufactured by Fair, Isaac Company of San Rafael, 

California (Fair Isaac). Fair Isaac has several decision engine products, 
including the TRIAD, Decision System, and StrategyWare decision 
engine products. 

The above steps are described in the following discussion. 
15 Modeling a decision situation 

The Strategy Optimizer formulates and specifies a decision situation as a 
decision model, using a modeling language based on the technology of 
influence diagrams. The following discussion describes the strategy 
20 optimization modeling language, including extensions to existing technologies 
that are unique to strategy optimization. 

Constrained optimization. In Strategy Optimizer, the user can associate a 
set of constraints with the model. This is important because real-world 
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decision situations almost always include significant constraints on the actions 
of the decision maker. Constraints require trade-offs between the results of 
acting in individual cases. Without constraints, optimal solution for each case 
is considered independently of the solution for all other cases. Other 
5 influence diagram optimize the decision in an unconstrained way, limiting their 
application to decision situations without significant constraints. 

Dataset-based optimization, in strategy optimization, the user can optimize 
over an explicitly specified set of cases, called the representative population. 
Other Bayesian network techniques optimize only over a set of cases 
10 implicitly specified by the joint prior distributions of the decision keys (strategy 
optimization can be applied in this way). The advantage of this extra 
functionality is that strategy optimization can be applied to situations where 
joint prior probabilities do not exist or are problematic to define. 

Mapping the decision model to a dataset file. This feature is essential to 
15 the dataset-based optimization described above. The user can map (or 
relate) each decision key in the decision model with a column in a dataset. 
No other Bayesian network techniques we know of boasts this feature. 

Formulating a strategy optimization decision model as a nonlinear 
optimization problem 

20 

One objective of the invention is to formulate a nonlinear optimization problem 
that corresponds exactly to a decision model specified in the modeling 
language of strategy optimizer. Once expressed this way, the optimal solution 
to the decision model can be determined by using integer programming. 
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The following discussion provides a formal notation for a decision model. 
The following notational conventions are followed: 

• Subscripts are indicated by an underscore, e.g. A J means "A subscript 

i" 

• Subscript i is used for cases and quantities corresponding to cases 

• Subscript j is used for constraints and quantities corresponding to 
constraints 

• A sum is indicated by the notation, "Sumjc in C) {f(c)} w , which means 
"the sum of the quantities computed by applying function f to each of 
the cases c in C" 

• Sets are indicated by upper-case letters 

• The number of elements in a set is indicated by capital N subscripted 
by the name of that set 

A decision model consists of the following elements: 

• A "decision variable" D with set A of possible alternative actions a_J , a_2, 
.,.,a_N_A(N_A>==1) 

• A set E of "decision key variables" E_1 , E_2, E_N_E that are known or 
observed at the time the decision is to be made for each case. (NLE >= 0) 
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• A set V of "value variables" Vjl , V_2, . , V_N_V. (N_V >= 1 ) 

• A set U of "portfolio input variables" IM, U_2, • • • $ LLN_U (NJJ >= 0). 
The joint state space of U, or the set of all possible instantiations of 
portfolio variables, is called M. There is a joint distribution P(mll=Mm) 
defined over all m in M. 

• A set K of "case-level constraints" K_1, K_2, K_N_K (N_K>=0) in one 
of two different forms: 

• "If q_j(E), then x in S j", where qj is a logical predicate that is either 
true or false and that is a function of the decision key variables, and S J 
is a subset of A 

• "h J(E, x) < d J", where hj is a function that evaluates to a real number 
and is a function of the decision and the decision key variables, and d J 
is a constant 

© A "representative population" or "portfolio" C of cases cj ...c_N_C (N_C 
>= 1). There is a set of sample weights W (wj, w_2, wJM_C), with 
each element w J corresponding to case cj. Because of this one-to-one 
correspondence, each element w_i can also be notated w(c_j). 

• A "dataset mapping" between the representative population and the 
decision model such that each case represents an instantiation of the 
decision key variables E_t , E_2, E_N_E 

• A set X of variables x_1, x_2, .... xJiJC. Each variable xj represents at 
least one action to assign to the case cj. Due to this one-to-one 
correspondence, the variable x_i is sometimes notated x(c_i). 
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• A set T of "portfolio-level constraints" T_1 , T_2, T_N_T (N_T >= 0) of 
the form e GJ(C,X,W) <= bj\ where bj is a constant and the functional 
forms for G J(C,X,W) include, but are not limited to: 

• The sum of some function g(c, x(c)) over all cases in C, or Sum_(c in 
C){w(c)*g(c, x(c))} 

• The mean of some function g(c, x(c)) over all cases in C, or the ratio of 
Sum Jc in C) { w(c) * g(c, x(c)) } to Sum_(c in C) { w(c) } 

• The mean or sum of some function g(c, x(c)) over a subset of cases in 
C 

• The sum of some function g2(c, x(c), m) over all cases in C for all 
possible instantiations m of the portfolio variables In U, or Sumjc in C, 
m in M) { P(U-m) * w(c) * g2(c, x(c), m) } 

Input to the procedure 

The input to the procedure is a decision model, as described above. 
Formulation procedure 

There are three steps in formulating the given decision, model as a nonlinear 
optimization problem: 

Step 1: Specify the cases in the representative population 

Step 2: Construct the feasible actions for the decision for each case 
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Step 3: Specify the objective function of the optimization 
These steps are described in the following discussion: 
Ste p 1 • Specify the cases in the re presentative population 

5 The procedure is: Select one of the methods for specifying a set of cases. 

There are a number of ways to specify the cases in the representative 
population, including but not limited to those listed here: 

• Use a set of cases from a previously encountered decision situation 

• Create, design, or specify a set cases manually 

10 • Draw N_C samples from the joint distribution of decision key variables in 
the decision model using Monte Carlo methods. This requires a joint 
probability distribution be defined over the decision key variables. 

• Enumerate all possible cases. This requires that the set of possible 
combinations of the values of the decision keys be finite. 

15 ste p 2: Constnmt the feasib le anions for the decision for each case 

The set of feasible actions for each case is notated A_i and is a function of the 
case-level constraints. 

20 The procedure is: For each case c_i 

• Instantiate the decision keys E according to that case 
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• For each case-level constraint KJ, compute the set A_ij of feasible 
decisions. 

• For constraints of type "If q_j(E), then x in SJ" , compute the logical 
predicate qJ(E). If it is true, then set A_jj to SJ. If it is false, then set 
5 A_ij to A (all possible actions are feasible) 

. For constraints of type "hJ(E, x) < dj", set A_ij to the empty set. For 
each alternative action a in A, compute the value of hJ(E, a). If hJ(E, 
a) < dj, then add a to AJj 

. Set A_i to be the set of all actions in A that are feasible for all constraints 
10 (that is, A_i is the intersection of the sets AJj) 

ca ft p a- S pecify the oh jective fu nction of the optimization 

The procedure is: Select one of the functional forms for the objective function 
F(C,X,W). 

15 The functional forms for F(C,X,W) include, but are not limited to those listed 
here. 

The sum of some function f(c, x(c)) over all cases in C, or Sumjcin C) { w(c) 
. M(c.x(c))} 

. The mean of some function f(c, x(c)) over all cases in C, or the ratio of 
20 Sum Jc in C) { w(c) * f(c, x(c)) } to Sum_(c in C) { w(c) } 

• The mean or sum of some function f(c, x(c)) over a subset of cases in C 
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• The sum of some function f2(c, x(c), m) over all cases in C for all possible 
instantiations m of the portfolio variables in U, or Sum_(c in C, m in M) { 
P(U=m) * w(c) M(c, x(c), m) } 

Often, the component function f(c, x(c)) is the sum of the value of the value 
5 variables, evaluated at each case. This Is computed for each case c by 
setting the decision keys in the decision model according to c, then setting the 
decision according to x(c), then evaluating all variables in the network, then 
evaluating the sum V_1 + V_2 + . + VJvLV. 

Result of the procedure 

10 

The result of this process is a nonlinear optimization problem stated as: 

• Select values for all variables x_i in X to maximize F(C,X,W) 

• Subject to the constraints: 

15 • xj is in AJ (for all i in 1 to N_C) 

G J(C,X,W) <= bj (for all j in 1 to NJT) 

Once formulated as a nonlinear optimization problem, the decision model can 
be optimized by using integer programming, as is known to those skilled in the 
art. 

20 

The output of the optimization procedure Is a strategy table, i.e. a flat-file 
dataset with a row for each case cj in the representative population C, one 

15 
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column for each decision key of case c_j, and a column for the optimal action 
x*j assigned to case c _i. 

Developing a strategy using induction techniques on the results of an 
optimization 

5 

One unique feature of the strategy optimization methodology is the method for 
developing strategies by applying induction techniques to the results of an 
optimization over a representative population. The following discussion 
describes this feature in detail, describing the process itself, and summarizing 
10 the advantages that the strategy optimization methodology provides over 
current methods. 



Procedure 

15 The induction problem can be expressed as follows: 

Find a rule R defined over the target population such that R(c_i) 
accurately specifies the value x*J for the cases cj in the 
representative population C, using the strategy table to help develop 
20 the rule R. 

Strategy optimization uses a form of the CART algorithm to perform this 
induction step, resulting in a strategy represented as a decision tree. 
However, other induction techniques can be used as well, leading to other 
25 representations for a strategy, e.g. fuzzy logic rules. 
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There are several things to note about this induction step: 

• The rule R uses some known quantities (the decision keys of cj) to select 
5 another quantity (the assigned actions). 

• The most accurate decision rule with respect to the representative 
population is a one for which r(cj) is the same as x* J, for each cj in C. 

• In many situations, a more complex rule is more accurate on the 
representative population than a simpler rule. However, often a simpler 

10 rule is preferred over a complicated rule because it is easier for a human 
subject matter expert to comprehend. Complexity can be measured by 
many ways, such as the length in characters of a computer program that 
implements a rule. 

• The preferred embodiment of the invention requires a rule R that is 
15 accurate, not 100% faithful. There may be reclassifications in which 

R(c_j) does not match x*J for individual cases, but R is supposed to 
represent the values accurately overall. 

Misclassifications may be acceptable as a trade off for a simpler rule. For 
instance, consider two rules for the same decision situation. Rule R1 assigns 
20 the same action to all cases in a certain segment. Rule R2 assigns the same 
action to all cases in a certain segment, with seven exceptions. Rule R1 may 
be preferred to Rule R2 because it is simpler, even though rule R2 
misclassifies those seven exceptional cases and may have a lower computed 
objective function than rule R1 because of it. 



17 



WO 02/084925 



PCTAJS02/11537 



Advantages 

Some key advantages of this approach, i.e. using induction after optimization, 
include: 

5 

59. One can develop strategies in decision situations where one or more of 
the decision keys are continuous quantities such as "income." In these 
situations, it is impossible to create a representative population that 
includes all possibl e cases. 

10 60. The resulting strategies can be described concisely to a client business 
executive. 

61. The resulting strategies are easier to understand, easier to edit, and 
more likely to provide insight to a domain expert than the raw strategy 
table containing thousands of individual cases. This allows the 

15 application of expert judgment and domain expertise to adjust the raw 
results of the optimization. 

62. The resulting strategies can be less susceptible to the problem of 
"overfitting" to the representative population. Overfitting is a 
phenomenon where a strategy works well on the population it was 

20 developed with, but works poorly on other similar populations. 

A main design requirement of the induction step is to provide output in a form 
that is easily digestible by decision engine software. Because all of the 
decision engine software uses tree constructs, it is preferred to induce trees 
from the strategy table. 
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An Exemplary Embodiment 

An organizing concept of the preferred embodiment is model-based and data 
driven strategy optimization (MDDDSO). This is shown in Fig. 1, in which 

5 strategies 10 are developed based upon the application of historical and 
designed data 12, and analyst judgement 11 to an action-based predictive 
modeling module 13. A decision model structure 14 results, which may be 
further revised by human input, i.e. that of analyst judgement 11. The 
decision model structure is used to produce the decision model 15. The 

10 decision model is subjected to both simulation 16 and optimization 17 by 
which strategies are evaluated and optimized. The simulation results 19 may 
be separately output. The optimized decision model is provided to a strategy 
editor .1 8 for further revision via analyst judgement 1 1 , arid resulting strategies 
10 are output for application to real world problem solving. 

15 

The MDDDSO supports the development of strategies given both data and 
human expertise. Much attention has been given to: 1) allowing user and 
expert interaction in all phases of strategy development, as well as 2) allowing 
the system to run in an automated fashion if deemed useful. Human 
20 expertise is captured both through models and in interactive guidance of the 
model building and strategy induction processes. Given data, the strategy 
optimization invention both generates strategies and revises them. 

The presently preferred embodiment of the invention provides a smooth 
25 transition from current strategy analysis techniques to the full MDDDSO 
capabilities. The presently preferred embodiment includes such capabilities 
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as a user-driven decision tree editor for use in strategy profiling and data- 
driven strategy development, provide the first steps away from current 
strategy design practice. The presently preferred embodiment of the 
invention also allows the evaluation of strategies against simulation models 
5 and the sole use of expert judgement to build models from which strategies 
can be derived. 

Strategy Design 

10 The following discussion provides background on strategy design. 

Organizations and objectives 

An organization seeks to achieve specific objectives, such as the 
15 maximization of profit, by making decisions. In domains such as e-business 
or credit card application processing where a decision situation is to be 
evaluated over a large number of cases, the decision making for individual 
cases often is delegated to an automated agent, such as a decision engine 
running on a computer. A decision is delegated to an agent by designing a 
20 strategy, which is a set of rules specifying the decision for the agent to make 
in any possible case. 

Automated strategies involving initial credit limit assignment, cross-selling, or 
other account management decisions are common. For the most part, these 



20 



WO 02/084925 PCT/US02/1 1537 

strategies are designed through expert judgement with no explicit assessment 
of how the strategies achieve the organization's objectives. 



Decision Situations 

5 A strategy is designed for execution in a decision situation. For example, Fair 
Isaac's TRIAD strategies are designed for account management decisions 
within a real-time mainframe environment. The kinds of strategies and 
decision-making capabilities needed differ with the decision situation. The 
major dimensions of a decision situation are: 

10 

• Who are the decision makers? Is it a single person or a group of 
people? If it is a group what is the responsibilities of each person in the 
group. 

15 • What is the relation of the decision to other decisions that the decision 
maker faces? Is the decision independent of other decisions or 
dependent? 

• Is the decision a one-off decision or a repetitive decision? Is the 
20 decision going to be made only once or is it going to be made hundreds 

or millions of times? 

• What is the magnitude of what's at stake in the decisions? Are the 
stakes large or small on each decision? 

25 
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• What is the availability of data on analogous decisions? Is there 
historical data that records the circumstances and values historical 
decisions? 

5 • What is the availability of expertise on the decisions or on components 
of the decisions? Is there significant expertise on this decision or 
components of the decision or is no human expertise available? 

• For repetitive decisions, what is the rate that the underlying situations 
10 change over time? 

• What is the time horizon over which the decisions re evaluated? 

• What is the amount of time available to make each decision? 

15 

• What is the amount of interaction that a decision maker has in making 
the actual decision? 

• How quickly does the decision/strategy need to be put together? 

20 

• How much and how important is the uncertainty in the decision 
situation? 

A few example decision situations are as follows: 

25 
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• The typical customer management situation: many repetitive 
decisions with relatively little at stake in each decision. There is a 
significant amount of uncertainty in the decision situation, e.g. 
good/bad, revenue, and attrition. Historical data are typically available, 
5 as well as expertise in making predictions about important variables in 

the situation. The rate at which the underlying situation changes is on 
the order of months. The time horizon over which each decision is 
evaluated is 6-1 8 months. 

10 • The strategic corporate decision environment: a single decision that 
is very high stakes, such as the acquisition terms in a corporate 
takeover. A small group of well-coordinated decision makers. The time 
horizon over which the decision is evaluated may be long, e.g. quarters, 
years. 

15 

• The strategic public policy decision environment a single decision 
that is very high stakes and for which there is a diverse set of 
stakeholders with significantly different objectives. An example is the 
siting of a nuclear waste disposal site. The time horizon over which the 

20 decision is evaluated may be very long, e.g. many years. 

• The Internet advertising environment: many repetitive decisions but 
no human expertise and no human intervention in the decision making 
process. Time horizon for evaluating the decision is very short, e.g. 

25 click-through rate. 
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Usually there is a significant amount of low level information about a decision 
situation. For example, in account management, one has all the low-level 
data in the master file and in the credit bureau to consider in addition to 
whatever other information might be available, e.g. account application 
5 information. One of the first tasks in building an optimal strategy is to define 
the high-level features, i.e. decision keys, on which the strategy is based. 
Defining the features and developing algorithms to compute them from the 
low-level information is known in artificial intelligence as the signal-to-symbol 
transformation and is critical to the process of building optimal strategies. 

10 Strategies 

A central concept of the invention is that of a strategy. A strategy maps the 
observable characteristics of a situation to at least one action that the client 
can take. The typical situation analyzed is an account or customer of the 
15 client, but it could be an entire portfolio. Cases are described by a set of 
decision keys, each of which has an attribute space of allowable values. The 
decision key space is the cross product of the attribute spaces of ail decision 
keys. The decision key space represents the entire set of allowable 
situations. A point in the decision key space represents a specific situation, 
20 i.e. each characteristic in the situation description takes on a unique value. 

The set of legal actions is called the action space. As with situations, there 
can be dimensions of the action space. For example, in making a loan offer, 
the dimensions of the loan offer includes APR, % down required, terms of the 
25 loan, etc. 



24 



WO 02/084925 PCT/US02/11537 

A deterministic strategy is a strategy that for each point in the decision key 
space designates a single point in action space. In contrast, probabilistic 
strategies associate a single point in decision key space with some probability 
distribution of points in action space. Probabilistic strategies, e.g. optimal 
5 strategies in bandit, are often useful in gaming situations and in repetitive 
decision situations where learning about the situation is an important 
component of the objective function. Champion-challenger is a probabilistic 
strategy. 

10 If the decision key space is discrete, the most common representation for a 
strategy is a table. To each point in the decision key space, at least one 
action is assigned from actionspace. A more sophisticated representation for 
a strategy over a discrete decision key space is a tree that coalesces common 
regions of space together. 

15 

If the decision key space is continuous, it is not possible to lists all the values 
of the decision key space in a table. The typical representation is the decision 
tree, which partitions the decision key space into segments in such a way that 
all the points in each segment have the same optimal action. In a decision 

20 tree, each branch of the tree represents a specific volume of the decision key 
space and has associated with it at least one specific action from the action 
space. There are many other ways that strategies can be represented. In 
fact, any functional form that can map from the decision key space to action 
space can represent a strategy, e.g. neural networks, computer programs, 

25 and tables. 



Action Segments 
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A decision tree segments a population into subpopulations, with one 
subpopulation for each leaf or segment of the tree. The subpopulation is 
defined by the splits or branches made between the root of the tree and the 
S leaf. Each branch, ie. interior node, of the tree has associated with it a split 
variable and a split value. The split variable is the independent variable that is 
splitting the population at the node. r 

In the strategy process it is important to define action segments, or segments 
10 to which a single action is the right thing for everybody in that segment. 
Action segments typically have a clear conceptual definition behind them. For 
example, in an attrition strategy one might want to treat accounts differently as 
a function of their attrition likelihood and profit potential, as listed in Table 1 . 



Table 1: An attrition strategy 



Segment 


Attrition 
Likelihood 


Profit 
Potential 


Action 


1 


High 


High 


aggressively defend and 
rebuild customer relationship 


2 


Low 


High 


build and sustain customer 
relationship 


3 


High 


Low 


low cost maintenance 


4 


Low 


Low 


choose to lose 



Historical Data 
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There are two kinds of historical data that are based on strategy execution: 
performance data and strategy data. Performance data describes situation 
information, e.g. account information, as well as information about the 

5 performance of that account, e.g. good/bad, revenue, attrition, etc., with 
respect to defined performance measures. These performance measures do 
not have to be the bottom-line measures, e.g. profit, but could be. Strategy 
data describes a situation (x) and gives the optimal action (a) for that 
situation. Strategy data is typically derived from models or historically through 

10 the development of historical action functions. 

Example 

The following is a simple example to illustrate the concepts presented above. 
Assume a typical decision situation. Assume a decision situation having 
15 many repetitive decisions with relatively little at stake in each decision. There 
is a significant amount of uncertainty in the decision situation, e.g. good/bad, 
revenue. 

Assume that the decision key space is made up of three dimensions: a risk 
20 score, a revenue score, and delinquency, e.g. 0, 1+ cycles. Assume that the 
scores are continuous-valued between 200 and 800. The decision key space 
can be pictured as shown in Fig. 2. 

Assume that the action space is a single dimension that is a credit line 
25 adjustment decision. Assume that there are three alternatives to credit line 
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adjustment are: decrease to current balance + 10%, no change and increase 
credit line by 10%. 



In this context, a strategy is any function that maps a particular set of values 
5 of risk score, revenue score, and delinquency, i.e. a point in this decision key 
space, to one of the actions, B CL = decrease", "CL = no change", W CL = 
increase," i.e. a point in the action space. An example strategy is shown 
below in rule format: 

10 If an account is delinquent (1+ cycles), CL = decrease. 

Else if, an account is not delinquent and the risk score is greater than 

720 and the revenue score is greater than 720 then increase 
Else, CL = No Increase 



15 The strategy can also be shown in decision key space (Fig. 3), and in a 
decision tree (Fig. 4). 

One surprising fact about strategies and decision trees is that every strategy 
can be represented in a large number of equivalent structures, e.g. different 

20 decision trees. In the example above, one could add an additional segment 
"revenue score," e.g. 0-360, 360-720, and 720-800, but as long as the 0-360 
and the 360-720 scenarios have "No Change" associated with them, the 
strategy is the same. How to choose a structure? A general rule of the thumb 
is to minimize the number of segments in a strategy, or the number of rules, 

25 used to describe a strategy. That is, it is desirable to have as many segments 
as necessary but no more. 
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If the segments are chosen such that one or more segments contain situation 
points that have different optimal actions, an optimal strategy is not possible. 
For example, assume that the strategy shown in Figs. 3 and 4 are optimal If 
5 the revenue score and the risk score are segmented by the set (200-400, 400- 
600, 600-800) only a crude approximation to the optimal strategy is possible. 

In general, there is a trade-off between performance and interpretability of 
strategies. Interpretability is a function of the number of segments, e.g. fewer 
10 segments are more interpretable, and the complexity of the segmentation 
description. One can often drastically reduce the number and complexity of 
segmentations in a strategy for a relatively small performance degradation. It 
is usually important in strategy situations, that the segmentations be easily 
interpretable. 

15 

Strategy Design 

Strategy design is primarily achieved with human expertise. Strategies 
typically are developed by strategy consultants with significant, specific 
20 industry experience in designing and applying strategies. 

Scores can be extremely important to optimal strategy design. For example, 
in the simple situation, e.g. account acquisition, where there is a binary 
decision, e.g. accept/reject, and only one variable, e.g. good/bad, that directly 
25 impacts the objective, e.g. profit, a sound decision-theoretic result is that the 
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decision can be based on thresholding of the risk score. The exact score 
threshold that is used depends on the relative costs of acceptance of a bad 
applicant and the benefits of accepting a good applicant. 

5 There are now significantly more complicated prediction and strategy design 
situations. The need to handle customer-based, in addition to account-based, 
predictions and the need to predict bottom-line performance, e.g. profit, rather 
than components of profit, e.g. risk, revenue, and attrition, are two major 
challenges for the development of prediction technology. Users are not only 

10 faced with the need to adapt to'the new predictors that are developed but also 
to new and rapidly changing decision areas. For example, "How can a 
company help it's clients make real-time decisions to cross-sell one or more 
products in a large range of product offerings given all that the client knows 
about a particular customer?" 

15 

While human expertise is certainly a critical component of strategy design, as 
it is in scoring, bringing the same kind of analytic support to strategy design 
that has been applied to predictions can be a major technological leap 
forward. 

20 

In particular, strategy design is a problem of decision making under 
uncertainty. Because decision analysis is the primary technology for 
- optimization of decisions under uncertainty, decision analysis techniques and 
tools are relevant to strategy design. 

25 Description of Strategy Optimization 
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The strategy optimization invention disclosed herein is an overall architecture 
for optimal strategy design using both historical data and human expertise. In 
the following discussion, strategy optimization is described in terms of the 
5 kinds of objects it manipulates, the functions for transforming objects and the 
use cases which organize the functions into particular configurations. 

Objects 

The strategy optimization invention manipulates three major types of objects: 
10 strategies, data, and models. In addition the invention manipulates: decision 
keys, decisions, action scenarios, and variable functions. These entities have 
complex relationships. For example, data can be used to create variable 
functions. Data can be used to pass through strategies as they are created. 
Models can result in a strategy, through an optimization process, and can use 
15 strategies as input for a simulation process. Strategies can be built with 
historical data or strategy data. Strategies can also be built by an analyst 
without any data or any model. Strategies can be evaluated by simulation 
with a model or with historical data. 

Strategies 

20 

Strategies can be represented as a decision tree. Decision trees are 
segmented with decision keys and at least one action is associated with each 
leaf of the tree. 
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Strategies have a: Decision key set, Decision variables set, Action set, and 
Decision tree. 



M ode ls 

5 Influence diagrams are used in the preferred embodiment as the basis for a 
modeling language. Several modifications are made to the influence 
diagrams language to create the herein disclosed modeling language. 

Data Sets 

10 - 

The preferred embodiment makes use of two kinds of data sets: historical 
data and strategy data. A data set is an array of values. Each column in a 
data set has an identifier. 

Decision Keys 

15 

Decision keys are variables that are observed at the time of a decision and on 
which the strategy for that decision may be based. 

Decision keys have a categorical or continuous value range and a set of 
20 decisions for which they are relevant. 

Decision Variables 
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Decision variables represent actions under the control of the agent. Decisions 
have a categorical or continuous value range, a set of relevant decision keys, 
and a set of relevant action scenarios. 



5 Varigble Functions 

Variable functions predict the value of a variable based on a set of input 
variables. Variable functions are assessed both from expert judgement and 
learned from data. Variable functions can be the output of the action-based 
10 predictor module. 

Variable functions have three distinct sets of variables: a predicted variable, a 
set of input variables, and a function of the input variables. 

Functions 

15 

There are six major functions in the preferred embodiment: 

• Strategy simulation and evaluation 
20 • Strategy editing 

• Action based prediction 

• Model construction 

25 
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• Strategy optimization 

• Data-driven strategy development 

5 These functions are described below. 
Strategy Simulation and Evaluation 

The analyst can evaluate the segmentation tree of a strategy with respect to a 
given performance measure by applying it to an existing historical dataset. 
10 The criteria for evaluation include the purity of the segments, the number and 
complexity of the branches, the misclassification with respect to the 
performance measure, and the plausibility of the segmentation. This dafa 
based simulation is a component of current strategy analysis and also 
includes tools such as lift curves. 

15 

The preferred embodiment extends the simulation capabilities by providing 
simulation based on models only or on a combination of models and data. 
The simulation model can either be a decision model, i.e. an influence 
diagram, or a separate simulation, compatible simulation programe.gr. 
20 Portfolio Forecasting and Strategy System, a Fair, Isaac invention to forecast 
the portfolio results for alternative strategies. 

Strategy Editing 

The analyst can edit a decision tree in numerous ways including creating new 
25 trees from existing trees, creating and deleting subtrees, and copying and 
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pasting subtrees. The analyst can work with a single tree or with multiple 
trees at once. The analyst can start from scratch or from an existing decision 
tree, such as one created through strategy optimization. 

Data-driven Strate g y Development 

5 

The analyst can build a decision tree With interactive computer support: The 
preferred embodiment automatically suggests optimal segmentations with 
respect to a given historical performance dataset. Data-driven strategy 
development is achieved through the interactive induction of 
10 segmentation/decision trees from performance data or strategy data. 

Action-based Prediction O , ~ ' 

Action-based prediction is the development of a predictive model for a 
dependent variable that includes at least one actions, i.e. decisions, as 
15 independent variables. A variety of model-building methods are available, for 
example regression, score-engineered predictive models, Bayesian Network 
learning, and neural networks. The system allows the user to place 
constraints, i.e. score-engineer, on the model-building processes. 

20 

The preferred embodiment provides analysts with ways to guide the prediction 
process and to directly edit the predictors. The user is able to visualize and 
edit variable functions. Extensions of Optimal St rategy Design bv a Learning 
Strategy 
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The notion of optimal strategy applies to either deterministic or probabilistic 
strategies. Probabilistic strategies are often preferred when it is envisioned to 
re-optimize a decision model from time to time, each time based on new 

5 evidence from new data. Then it is possible to develop more optimal 
strategies at a later time, by maximizing the amount of information in the data, 
which are generated under the strategies deployed at an earlier time. Such 
data that are generated with the goal of learning in mind are designed data. 
One important goal of learning is the accurate inference of causal 

10 relationships. Strictly speaking, it is not possible to infer causal relationships 
without randomization. Randomized strategies present a solution to this 
problem. Another goal of learning is to explore a wide space of strategies so 
that it is possible to react swiftly to external changes. Randomized strategies 
can be designed to generate data, which allow for the development of an 

15 action-based predictive modeling module that predicts with accuracy and 
confidence over a wide operating range. A randomized strategy designed with 
these goals in mind is referred to as a learning strategy. A learning strategy 
generates designed data at minimum cost, and subject to business and 
operational constraints, including case and portfolio level constraints, 

20 constraints on test sample size, the maximum allowed number of test cells, 
and constraints on factor combinations making up for these test cells. 

A learning strategy is an optimal strategy running in parallel with such 
25 designed experiments. Tools from the science of experimental design, 
including full and fractional factorial designs, constrained optimal designs, 
screening designs, and response surface designs, are used to design 
experiments. The experiments are applied to a random fraction of accounts in 
such a way that these action values deviate from the optimal actions in a 
30 controlled fashion. Both the fraction of accounts submitted to experiments and 
the range of action values affect the short-term cost of experimentation and 
the long-term benefit from learning. 



WO 02/084925 



PCT/US02/11537 



The value of the learning strategy is calculated using a time-discounted 
cost/benefit analysis. The method requires assumptions on the prior 
distribution of the unknown true optimum and the risk aversion of the decision 
5 maker. Furthermore, it is possible to optimize the value of learning based on 
these assumptions. 

Model Construction 

10 Model construction is directed by an analyst through the user interface and 
involves problem framing, variable identification, model structuring, and 
variable function definition. 

Problem framing. The analyst determines the decisions to be made, the 
15 information known at each decision, Le. the decision keys, the objectives to 
be optimized, and the constraints to be met. The information structure 
determines how many stages, or discrete time periods, there are in the 
decision model. In a single stage model, the same information is known at all 
decisions, while in a multi-stage model, more information is known at later 
20 decisions than at earlier decisions. For instance, a model with a single 
decision is a single stage model. The number of decisions in each stage 
determines the dimensionality of the model. Multi-dimensional decision 
models have multiple decisions made at a single point in time. In credit card 
direct marketing, for example, decisions regarding offer parameters such as 
25 teaser rate and balance transfer rate constitute the multiple dimensions of the 
offer stage. 
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Variable identification. The analyst identifies variables relevant to the 
strategy. The analyst classifies variables as decision keys, e.g. scores, 
decisions, objective function, constraints, or intermediate variables. 



5 Model structuring. The analyst specifies the structure between the variables 
-using influence diagrams. Expert judgement is the primary method, but there 
are also Bayesian Network learning methods that could be used in certain 
situations. 

10 Variable function definition. The analyst quantifies the relationship between . 
the variables as conditional probability tables or as mathematical formulas. 
The analyst can use expert judgement or a data-based action-based 
prediction method such as regression, Bayesian network learning, or neural 
networks. 

15 

Strategy Optimization 

The analyst can find an optimal strategy for a decision situation expressed as 
a decision model. When there are no constraints in the objective, each 
20 combination of decision keys can be optimized separately: traditional 
influence diagram algorithms are sufficient to solve these individually. If there 
are global constraints, the optimizations for each combination of decision keys 
are coupled. 
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The preferred embodiment provides functionality for both types of problems. 
It can also induce a decision tree from a strategy dataset using the induction 
techniques of classification tree algorithms. 

Decision Engine Interface 

5 

The preferred embodiment writes strategies to and read strategies from 
decision engines. 

Use Cases 

10 

The following discussion lists five use cases envisioned for the strategy 
optimization invention and describes the tasks an analyst can perform under 
each. 

Strateg y Simulation 

15 

An analyst evaluates the structure, the performance by segment, and the 
overall performance of an existing strategy by applying it to historical data. 
This is also called strategy profiling. The analyst can compare multiple 
strategies at the same time. 

20 

By building a probabilistic model that incorporates the strategy, as well as 
other variables, the analyst can assess the impact of the strategy on higher- 
level objectives such as profit. 

hflanuaj Strategy Design 
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Fig. 5 shows the strategy design cycle for this use case. An analyst builds a 
decision tree manually using expert judgement. The analyst first specifies 
decision keys and at least one action scenario. Next, the analyst grows the 
5 tree by splitting branches one at a time, selecting a decision key and 
segmentation at each split. The analyst assigns at least one action scenario 
to the leaves of the completed tree. Finally, the analyst evaluates the 
effectiveness of the strategy using strategy simulation. Having evaluated the 
strategy, the analyst can fine tune the strategy and start the cycle again. 

10 

Strate gy De sign with Data 

Fig. 6 shows the strategy design cycle for this use case. An analyst builds a 
decision tree as above (manually), using historical data to help select 
15 segmentations and to profile tree segments. The analyst can accept the splits 
generated automatically by the strategy optimizer or he can edit them in any 
way. 

This use case exercises the data-driven strategy design and strategy editing 
20 functions 60 and could use the strategy simulation module also. 

Model-based Strategy Design 

Fig. 7 shows the strategy design cycle for this use case. An analyst creates a 
quantitative model using expert judgement that relates the decision keys, the 
25 decisions, the objective, and the constraints, using the model construction 
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function. The analyst then derives an optimal strategy from the model, using 
the strategy optimization function. The analyst can modify the strategy using 
the strategy editing function as in the previous use cases. The simulation 
function could be used to evaluate the strategy. This use case was 
demonstrated in research on risk-based pricing (see R. Oliver, R. Fung, Better 
Decision Strategies Through Data and Models with an Application to Risk 
Based Pricing, Fair, Isaac Technology Seminar (January 29, 1998); — 

Model-based Strategy De sign with Data 

Fig. 8 shows the strategy design cycle for this use case. This use case uses 
all the capabilities of the preferred embodiment. Given a data set, the action- 
based predictor function 80 is used to learn one or more of the variable 
functions in the model. 
15 

Comparing Model-based and Data-driven approaches 

In general, the model-based approach provides a significant leap in strategy 
performance because it offers the promise of approximately optimal 
20 performance. 

The data-driven approach is much easier to use because it is much closer to 
the ways things are done now. 
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The normal process of strategy development involves initial strategy 
development and then evolution of strategies through execution, 
experimentation, and performance analysis. Initial strategy development is 
usually done primarily with human expertise. The model-based module can 
5 be used to guide this process. Performance analysis can be achieved 
through the data-driven module. Both the model-based module and the data- 
based module can be used to derive new strategies with data. 

Given a particular segmentation and a model of the situation, a cardinal 
10 ordering of the decision alternatives can be derived for each segment. If 
many decisions are close to the optimal these should probably be tried and a 
single challenger strategy can be derived that simultaneously tries out these 
close alternatives for different segments. 

1 5 Questions about Model-based Analysis 

yv fmre dd the num bers come from? 

The model-based approach requires a significant number of structural and 
quantitative judgments from experts. While using expert judgment is now the 
20 only method by which strategies are developed, in scoring this is completely 
opposite - judgments are frowned upon, data is the only way to develop 
predictions. 

Mow does the invention use scores? 
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The model-based module must translate the relative risk ranking of scores to 
measure probabilities, i.e. odds. This is absolutely necessary for decision 
making and is clearly understood by strategy consultants. 

5 

How can decision situations with multip le part, i.e. vector, decisions, e.q, 
credit limit. APR, be handled by the architecture? 

This is straightforward. Each part of the decision is considered as a separate 
10 decision and sequenced according to the analyst's preferences. 

Can continuous decisio n spaces be handled? 

The prototypical decision is categorical and is the easiest to handle but 
continuous decision variables are possible. It preferably requires a fast 
15 nonlinear optimization routine because the form of the objective function is not 
known in general. Discretization of the continuous variable is another 
possibility. 

Although the invention is described herein with reference to the preferred 
20 embodiment, one skilled in the art will readily appreciate that other 
applications may be substituted for those set forth herein without departing 
from the spirit and scope of the present invention. Accordingly, the invention 
should only be limited by the Claims included below. 
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CLAIMS 

1 . An apparatus for strategy design and strategy analysis, comprising: 
one or more extensions to influence diagram methodology; 

a mechanism for translation of an influence diagram as a nonlinear 
optimization problem; and 

a mechanism for induction after optimization to derive decision rules. 

2. The apparatus of Claim 1, wherein said influence diagram methodology 
comprises any of a Bayesian network, Bayes' Net, and Bayesian model. 

3. The apparatus of Claim 1, wherein said extensions to influence 
diagram methodology comprise any of: 

a mechanism for effecting constrained optimization, which places 
constraints on an optimization in an influence diagram; 

a mechanism for effecting dataset-based optimization over an explicitly 
specified set of cases which comprise a representative population, as well as 
over a set of cases implicitly specified by joint prior distributions of decision 
keys; and 

a mechanism for mapping a decision model to a dataset file to map 
variables in an influence diagram to columns of a dataset 
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4. The apparatus of Claim 1 , wherein said mechanism for translation of 
an influence diagram as a nonlinear optimization problem allows an influence 
diagram to contain continuous variables with complex functional forms, a 
specification of high-dimensional representative populations, and constraints. 

5 

5. The apparatus of Claim 1, wherein said mechanism for induction 
derives simpler decision rules from many cases to allow for strategies that are 
more understandable, easier to edit, and easier to implement than at least 
one action provided in the raw optimization results. 

10 

6. A method for using strategy optimization, comprising the steps of. 
creating a strategy optimization influence diagram and a representative 

population to model a decision situation mathematically; 

formatting said strategy optimization influence diagram as a nonlinear 
15 optimization problem over a representative population; 

using nonlinear optimization techniques to assign at least one action 
to each case in a representative population; 

creating a strategy table containing all cases in said representative 
population, along with at least one assigned action for each case; 

20 applying induction techniques to said strategy table to derive a set of 

rules from data; and 

operationalizing a set of rules derived by induction through a decision 
engine. 
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7. The method of Claim 6, wherein said influence diagram comprises any 
of a Bayesian network, Bayes' Net, and Bayesian model. 

5 

8. The method of Claim 6, wherein said step of modeling a decision 
situation formulates and specifies a decision situation as a decision model, 
using a modeling language based on influence diagrams. 

10 9. The method of Claim 7, wherein said modeling language comprises 
any of the steps of: 

constraining optimization by associating a set of constraints with said 
decision model; 

performing dataset-based optimization over an explicitly specified set 
15 of cases; and 

mapping said decision model to a dataset file, wherein each decision 
key in said decision model is mapped with a column in a dataset. 

10. The method of Claim 6, wherein said step of formatting said strategy 
20 optimization influence diagram as a nonlinear optimization problem over a 
representative population formulates a nonlinear optimization problem that 
corresponds substantially exactly to a decision model specified in said 
modeling language; wherein an optimal solution to said decision model can 
optionally be determined by using integer programming. 

25 



46 



WO 02/084925 PCT/LS02/1 1537 

11. The method of Claim 9, wherein a decision model is used an input to 
said strategy optimization influence diagram. 



5 12. The method of Claim 10, wherein said step of formulating a given 
decision model as a nonlinear optimization problem further comprises the 
steps of: 

specifying cases in said representative population; 
constructing feasible actions for a decision for each case; and 
10 specifying an objective function of said optimization. 

1 3. The method of Claim .1 2, wherein said step of specifying cases in said 
representative population comprises the step of: 

selecting one of any of the following methods for specifying a set of 

15 cases: 

using a set of cases from a previously encountered decision situation; 

any of creating, designing, and specifying a set cases manually; 

drawing samples from a joint distribution of decisfon key variables in 
said decision model using Monte Carlo methods; wherein a joint probability 
20 distribution is defined over said decision key variables; and 

enumerating all possible cases; wherein a set of possible combinations 
of values of said decision keys is finite. 
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14. The method of Claim 12, wherein said step of constructing feasible 
actions for a decision for each case comprises the step of: 

notating a set of feasible actions for each case as AJ; wherein AJ is a 
5 function of case-level constraints; 

for each case cj instantiating said decision keys E according to that case; 

for each case-level constraint KJ computing a set Ajj of feasible 
decisions; 

setting AJ to be a set of all actions in A that are feasible for all constraints, 
10 such that , A J is the intersection of the sets A_ij; 

computing a set A_jj of feasible decisions by selecting a method 
appropriate to a type of case-level constraint, wherein: 

for constraints of type "If qJ(E), then x in SJ" computing a logical 
predicate qJ(E); 

15 if qJ(E)is true, then setting Ajj to SJ; 

if qJ(E)is false, then setting Ajj to A; wherein all possible actions are 
feasible; 

for constraints of type "hJ(E, x) < d J* setting Ajj to an empty set 
for each alternative action a in A computing a value of h J(E, a); 
20 if hJ(E, a) < d J, then adding a to Ajj. 
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15. The method of Claim 12, wherein said step of specifying an objective 
function of the optimization comprises the step of: 
natating said objective function by F(C,X,W); 

selecting a functional form for F(C,X,W), including any of: 

a sum of some function f(c, x(c)) over all cases in C, or Sum_(c in C) { 
w(c)*f(c,x(c))} 

a mean of some function f(c, x(c)) over all cases in C, or a ratio of 
Sum_(c in C) { w(c) M(c, x(c)) } to Sum_(c in C) { w(c) } 

j 

a mean or sum of some function f(c, x(c)) over a subset of cases in C; 

and 

a sum of some function f2(c, x(c), m) over all cases in C for all possible 
instantiations m of portfolio variables in U, or Sum_(c in C, m in M) { P(U=m) * 
w(c) * f(c, x(c), m) }. 

16. The method of Claim 12, wherein said step of specifying an objective 
function of the optimization comprises the step of: 

computing a component function f(c, x(c)), which is a sum of a value of 
value variables, evaluated at each case, for each case c by: 

setting said decision keys in said decision model according to c; 

then setting a decision according to x(c); 

then evaluating all variables in a network; and 

then evaluating a sum V_1 + V_2 + ... + V_N_V. 
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17; The method of Claim 15, wherein a resulting nonlinear optimization 
problem is stated as: 

selecting values for all variables x_i in X to maximize F(C,X,W); 

5 ^ subject to the constraints: 

xj is in A J (for all i in 1 to N_G); and 

G J(C,X,W) <= bj (for all j in 1 to NJT); and 

once formulated as a nonlinear optimization problem, optionally 
optimizing said decision model by using integer programming, 
10 wherein a strategy table is output with a row for each case cj in a 

representative population C, one column for each decision key of case c_i, 
and a column for an optimal action x*j assigned to case c_L 

18. The method of Claim 6, wherein said step of developing a strategy 
15 using induction on results of an optimization comprises the step of: 

using any of induction techniques which include Bayes Net Learning, 
and CART (Classification and Regression Trees). 

19. The method of Claim 6, wherein said step of developing a strategy 
20 using induction on results of an optimization comprises the step of: 

developing any of decision rules and strategies to express accurately 
how to assign at least one action to each case, based on its decision keys. 
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20. The method of Claim 6, wherein said step of developing a strategy 
using induction on results of an optimization comprises the steps of: 

finding a rule R defined over a target population, such that R(cJ) 
accurately predicts a value x*J for cases cj in a representative population C; 
5 and 

using a strategy table to help develop said rule R. 

21 . The method of Claim 20, wherein said rule R uses decision keys of c_i to 
either of compute or predict at least one assigned action. 

10 

22. The method of Claim 20, wherein a most accurate decision rule is a 
rule for which r(c_i) is the same as x*_j, for each cj in C. 

23. A method for strategy design and strategy analysis, comprising the 
15 steps of: 

providing one or more extensions to influence diagram methodology; 
translating an influence diagram as a nonlinear optimization problem; 

and 

induction.after optimization to derive decision rules. 

20 

24. The method of Claim 23, wherein said step of providing extensions to 
influence diagram methodology comprise any of the steps of: 

effecting constrained optimization by placing constraints on an 
optimization in an influence diagram; 
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effecting dataset-based optimization over an explicitly specified set of 
cases which comprise a representative population, as well as over a set of 
cases implicitly specified by joint prior distributions of decision keys; and 

mapping a decision model to a dataset file to map variables in an 
5 influence diagram to columns of a dataset. 

25. The method of Claim 23, wherein said step for translation of an 
influence diagram as a nonlinear optimization problem allows an influence 
diagram to contain continuous variables with complex functional forms, a 

10 specification of high-dimensional representative populations, and constraints. 

26. The method of Claim 23, wherein said step for induction derives 
simpler decision rules from many cases to allow for strategies that are more 
understandable, easier to edit, and easier to implement than an action 

15 provided in the raw optimization results. 

27. An apparatus for using strategy optimization, comprising: 

a strategy optimization influence diagram and a representative 
population for modeling a decision situation mathematically; 

20 a mechanism for formatting said strategy optimization influence 

diagram as a nonlinear optimization problem over a representative population; 

a mechanism for using nonlinear optimization techniques to assign at 
least one action to each case in a representative population; 
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a strategy table containing all cases in said representative population, 
along with at least one assigned action for each case; 

a mechanism for applying induction techniques to said strategy table to 
derive a set of rules from data; and 

5 a mechanism for operationalizing a set of rules derived by induction 

through a decision engine. 

28. The apparatus of Claim 27, wherein said influence diagram comprises 
any of a Bayesian network, Bayes' Net, and Bayesian model. 

10 

29. The apparatus of Claim 27, wherein said mechanism for modeling a 
decision situation formulates and specifies a decision situation as a decision 
model, using a modeling language based on influence diagrams. 

15 

30. The apparatus of Claim 29, wherein said modeling language 
comprises any of: 

a mechanism for constraining optimization by associating a set of 
constraints with said decision model; 
20 a mechanism for performing dataset-based optimization over an 

explicitly specified set of cases; and 

a mechanism for mapping said decision model to a dataset file, 
wherein each decision key in said decision model is mapped with a column in 
a dataset. 
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31. The apparatus of Claim 27, wherein said mechanism for formatting 
said strategy optimization influence diagram as a nonlinear optimization 
problem over a representative population formulates a nonlinear optimization 

5 problem that corresponds substantially exactly to a decision model specified 
in said modeling language; wherein an optimal solution to said decision model 
can optionally be determined by using integer programming. 

32. The apparatus of Claim 31, wherein a decision model is used an input 
10 to said strategy optimization influence diagram. 

33. The apparatus of Claim 31 , wherein said mechanism for formulating a 
given decision model as a nonlinear optimization problem further comprises: 

15 a mechanism for specifying cases in said representative population; 

a mechanism for constructing feasible actions for a decision for each 
case; and 

a mechanism for specifying an objective function of said optimization. 

20 34. The apparatus of Claim 33, wherein said mechanism for specifying 
cases in said representative population comprises: 

a mechanism for selecting one of any of the following mechanisms for 
specifying a set of cases: 
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a mechanism for using a set of cases from a previously encountered 
decision situation; 

a mechanism for any of creating, designing, and specifying a set cases 
manually; 

5 a mechanism for drawing samples from a joint distribution of decision 

key variables in said decision model using Monte Carlo methods; wherein a 
joint probability distribution is defined over said decision key variables; and 

a mechanism for enumerating all possible cases; wherein a set of 
possible combinations of values of said decision keys is finite. 

10 

35. The apparatus of Claim 33, wherein said mechanism for constructing 
feasible actions for a decision for each case comprises a mechanism for: 

notating a set of feasible actions for each case as A_i; wherein AJ is a 
function of case-level constraints; 
15 for each case c_l instantiating said decision keys E according to that 

case; 

for each case-level constraint KJ computing a set A_ij of feasible 
decisions; 

setting AJ to be a set of all actions in A that are feasible for all 
20 constraints, such that , A J is the intersection of the sets AJj; 

computing a set AJj of feasible decisions by selecting a method 
appropriate to a type of case-level constraint, wherein: 
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for constraints of type u lf qJ(E), then x in SJ" computing a logical 
predicate qJ(E); 

if qJ(E)is true, then setting AJj to SJ; 

if qJ(E)is false, then setting AJj to A; wherein all possible actions are 
5 feasible; 

for constraints of type "h J(E, x) < dj" setting AJj to an empty set 
for each alternative action a in A computing a value of h J(E, a); 
if hj(E, a) < dj, then adding a to AJj. 

36. The apparatus of Claim 33, wherein said mechanism for specifying an 
10 objective function of the optimization comprises a mechanism for 

notating said objective function by F(C,X,W); 

selecting a functional form for F(C,X,W), including any of: 

a sum of some function f(c, x(c)) over all cases in C, or Sum_(c in C) { 
w(c)*f( C) x(c))} 

15 a mean of some function f(c, x(c)) over all cases in C, or a ratio of 

Sumjc in C) { w(c) * f(c, x(c)) } to Sum Jc in C) { w(c) } 

a mean or sum of some function f(c, x(c)) over a subset of cases in C; 
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a sum of some function f2(c f x(c), m) over all cases in C for all possible 
instantiations m of portfolio variables in U f or Sum_(c in C, m in M) { P(U=m) * 
w(c) * f(c, x(c), m) }. 

5 37. The apparatus of Claim 33, wherein said step of specifying an objective 
function of the optimization comprises: 

a mechanism for computing a component function f(c, x(c)), which is a 
sum of a value of value variables, evaluated at each case, for each 
case c by: 

10 setting said decision keys in said decision model according to c; 

then setting a decision according to x(c); 
then evaluating all variables in a network; and 
then evaluating a sum V_1 + VJ2 + ... + V_N_V. 

15 

38. The apparatus of Claim 36, wherein a resulting nonlinear optimization 
problem is stated as: 

selecting values for all variables xj in X to maximize F(C,X,W); 

subject to the constraints: 

20 x_i is in A_i (for all i in 1 to N_C); and 

GJ(C,X,W) <= bj (for all j in 1 to NJT); and 

once formulated as a nonlinear optimization problem, optionally 
optimizing said decision model by using integer programming. 
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wherein a strategy table is output with a row for each case cj in a 
representative population C, one column for each decision key of case c_j, 
and a column for an optimal action x*_i assigned to case cj. 

5 39. The apparatus of Claim 27, wherein said mechanism for developing a 
strategy using induction on results of an optimization comprises: 

a mechanism for using any of induction techniques which include 
Bayes Net Learning, and CART (Classification and Regression Trees). 

10 40. The apparatus of Claim 27, wherein said mechanism for developing a 
strategy using induction on results of an optimization comprises: 

a mechanism for developing any of decision rules and strategies to 
express accurately how to assign at least one action to each case, based on 
its decision keys. 

41 . The apparatus of Claim 27, wherein said mechanism for developing a 
strategy using induction on results of an optimization comprises: 

a mechanism for finding a rule R defined over a target population, such 
that R(cj) accurately predicts a value x*J for cases cj in a representative 
20 population C; and 

a strategy table to help develop said rule R. 

42. The apparatus of Claim 41 , wherein said rule R uses decision keys of 
c_i to either of compute or predict at least one assigned action. 

25 
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• 43. The apparatus of Claim 41 , wherein a most accurate decision rule is a 
rule for which r(cj) is the same as x*_i, for each c_i in C. 



44. An apparatus for using strategy optimization, comprising: 
5 a strategy optimization influence diagram and a representative population 
for modeling a decision situation mathematically; and 

a mechanism for formatting said strategy optimization influence 
diagram as a nonlinear optimization problem over a representative population. 

10 45. The apparatus of Claim 44, wherein said influence diagram comprises any 
of a Bayesian network, Bayes' Net, and Bayesian model 46. An apparatus for 
using strategy optimization, comprising: 

a mechanism for using nonlinear optimization techniques to assign at 
least one action to each case in a representative population; and 

15 a strategy table containing all cases in said representative population, 

along with at least one assigned action for each case. 

47. An apparatus for using strategy optimization, comprising: 

a mechanism for applying induction techniques to a strategy table to 
20 derive a set of rules from data; and 
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a mechanism for operationalizing a set of rules derived by induction 
through a decision engine. 



48. A method for using strategy optimization, comprising the steps of: 

5 providing a strategy optimization influence diagram and a 

representative population for modeling a decision situation mathematically; 
and 

formatting said strategy optimization influence diagram as a nonlinear 
optimization problem over a representative population. 

10 

49. The method of Claim 48, wherein said influence diagram comprises 
any of a Bayesian network, Bayes 1 Net, and Bayesian model. 

50. A method for using strategy optimization, comprising the steps of: 

15 using nonlinear optimization techniques to assign at least one action 

to each case in a representative population; and 

providing a strategy table containing all cases in said representative * 
population, along with at least one assigned action for each case. 

20 51 . A method for using strategy optimization, comprising the steps of: 
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applying induction techniques to a strategy table to derive a set of rules 
from data; and 

operationalizing a set of rules derived by induction through a decision 
engine. 
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